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A EUKARYOTIC EPISOMAL DNA CLONING AND EXPRESSION VECTOR 



This invention relates to the development of 
recombinant eukaryotic cloning and expression vectors 

15 based on unique regulatory elements isolated from 

autonomously replicating, stable episomal units isolated 
from human tumor cell lines. More specifically, the 
unique regulatory elements include origins of DNA 
replication, and DNA sequences that confer 

20 extrachromosomal stability and maintenance. These unique 
episomal regulatory elements permit large pieces of DNA to 
be expressed or cloned (greater than 50 kilobase pairs 
[kb] in size) . 

25 During the past decade, the underlying significance 

of recent advances in molecular biology has been the 
ability to clone and manipulate DNA from virtually any 
source by ligating restriction fragments into phage or 
plasmid vectors which are then replicated in E . coll. 

30 

Since then, a crucial technological gap has developed 
in what is commonly called "conventional recombinant DNA 
technology." This technological gap stems from two 
developments. The first is the discovery that many 
35 eukaryotic genes are encoded by enormous lengths of DNA. 

The second is an optimistic and enthusiastic goal of 
mapping and sequencing entire genomes, including the human 
genome . 
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Because of the large size of DNA in many genes from 
higher organisms, this size limitation and restriction can 
be stifling. For example, hi thorax locus in Dropsophila, 
which plays an active role in the fly's segmentation 
5 pattern, encompasses approximately 320 kb (Karch, et al., 

Cell 43:81, 1985). Factor VXI1 gene in the human which 
encodes the blood-clotting factor deficient in 
hemophiliacs, spans at least 190 kb (Gitschier, et al., 
Nature ( London 1 . 312:326, 1984). The gene that is 

10 defective in Duchenne's muscular dystrophy is estimated to 
include more than a million base pairs (1000 kb) . A 
striking feature of this gene is the protein-coding 
portion may be encoded by as little as 15 kb of DNA 
(Monaco, et al, Nature f London^ . 302:575, 1983). Thus, 

15 there is a strong need for technological advances which 

permit the cloning and expression of very large genes. 

Also widening this technological gap is the increased 
interest in and enthusiasm for gene replacement therapy. 

20 Proposals to use genes to treat cancer and immune 

deficiencies have only recently been approved by the 
National Institutes of Health human gene therapy 
subcommittee and the Recombinant DNA Advisory Committee 
( Science , 249:974, August, 1990). These first studies 

25 focus on: 

(1) delivering tumor necrosis factor (TNF) directly 
to a tumor site in much larger doses by 
packaging the gene for TNF inside special 

30 lymphocytes that have a natural 

affinity for tumors; and 

(2) attempting actual gene replacement therapy in 
children with a rare, inherited and often lethal 

35 immune system disorder caused by adenosine 

deaminase deficiency. A normal healthy 
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recombinantly produced ADA gene will be 
introduced into the white blood cell of an ADA 
deficient child and the cells are then returned 
to the patient ( Id. 975) . 

5 

To narrow this gap, molecular biologists are 
attempting to clone large pieces of exogenous DNA into 
compatible hosts by means of artificial vectors. However, 
standard recombinant DNA techniques, that involve the 

10 construction of small plasmid vectors that can be 

transf ected into host cells and clonally propagated, are 
limited in the amount of exogenous DNA that can be 
"squeezed" or inserted into these vectors. These size 
restrictions only permit about 50 kilobase pairs (kb) to 

15 be cloned into the vectors usually employed in cloning. 

More limitations exist when the discussion turns to 
the bacterial expression of mammalian proteins. The 
current technology for expressing mammalian proteins in 
20 bacteria is hampered with problems relating to post 

translational modifications and functional bioactivity. 

To date, cloning of large segments of exogenous DNA 
in the range of several hundred kilobase pairs has only 

25 been achieved by employing yeast. This was done by 

ligating exogenous DNA to vector sequences that allow 
their propagation as linear artificial chromosomes (Burke, 
et al, Science , 236:806, 1987). Although this technique 
is a significant step towards resolving this size 

30 restriction, cloning large segments of exogenous DNA into 
yeast is not without limitations. Questions and concerns 
about this technology pertain to (1) the stability of the 
recombinants, (2) whether clone banks are representative 
of the starting material, (3) whether the desired protein 

35 is consistently expressed in extrachromosomal vectors, and 
(4) whether normal human transcripts are properly 
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processed in yeast, as well as, whether proper expression 
and post translations! modification of the recombinant 
protein occurs in yeast. 

5 Therefore, with the yeast expression system and its 

limitations, there is still a very strong need to design 
and construct eukaryotic expression and cloning vectors 
possessing the capabilities of housing very large regions 
of DNA (greater than 50 kb) and of accurately processing 
10 and expressing of these large genes. With such a novel 

vector, large regions of DNA that span genes can then be 
cloned and whole proteins encoded by the genes can then be 



15 One mechanism by which a cell can accumulate large 

amounts of specific protein or RNA is by amplification of 
the respective gene. This amplification may be located on 
either expanded chromosomal regions (homogenous staining 
regions) or on extrachromosomal autonomously replicating 

20 elements (called double minute, double minute chromosomes 

or episomes) . 



Episomes have unique features; the most notable are 
that episomes autonomously replicate and are stably 

25 maintained extrachromosomal ly. The characteristics of 

episomes permits the continuous production of the 
respective amplified gene and the gene products it 
encodes. For example, an episome produced in hamster 
cells has been characterized to contain amplified amounts 

30 of a transfected CAD (CAD is an acronym for the 

multifunctional protein containing carbamylphosphate 
synthetase, aspartate transcarbamylase , and 
gihydroorotase) gene at high frequency (Carrol, et al., 
MQXecular and Cellular Biology. 7(5):1740, 1987). The 

35 amplified CAD gene is produced with each division of each 

cell. 
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Viral episomes have also been id ntified. It has 
been demonstrated that papilloma viral DMA replicates like 
a plasmid in mouse cells. Circular bovine papilloma virus 
(BPV) DNA can transform certain mouse cell lines to a 
5 malignant phenotype. In these transformed cell lines, the 
BPV DNA remains circular and extrachromosomal at about 30 
- 100 copies per cell. This -plasmid" is being stably 
maintained in higher eukaryotes. Desired genes may be 
inserted into the BPV DNA and be maintained in the 

10 plasmid-like state and high levels of mRNA and protein 

corresponding to the desired gene can be produced. It has 
also been shown that Epstein-Barr virus vectors contain 
sequences that provide extrachromosomal stability of 
episomal DNA as well as origins of replication. This 

15 viral vector has been used to identify human DNA sequences 
that permit autonomous replication in human cells (Krysan, 
ct al., ffrl— ™ d r»nul«r Biology, 9(3):i026, 1989). 
But, it can be appreciated that there are many limitations 
when working with a virally produced protein. For 

20 example, in terms of producing proteins that may 

ultimately be used to replace defective human genes, viral 
episomes probably are not feasible because of potential 
Food and Drug Administration regulations, etc. Also the 
viral episome eventually integrates into chromosomal sites 

25 which then interferes with continued amplification and 
causes the expression of its resident genes to be 
extinguished. 

Thus, the limitations in terms of integration into 
30 chromos mal sites and of potential hazards pertaining to 
the use of viral based vectors for amplification and 
expression apply to all eukaryotic viral episomes. 

It is the intent of this invention to describe a 
35 eukaryotic cloning and an expression vector which will 
accommodate genes that exceed the cosmid limit (greater 
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than 50 kb) and permit their accumulation and maintenance 
as autonomously replicating extrachr omosoma 1 elements in 
mammalian cells. This invention is therefore unique by 
providing autonomous replication and expression of large 
5 genes in a vector containing episomal regulatory elements. 

This minimal cloning or expression vector will be 
further modified by the inclusion of regions of human 
chromosomes containing telomeres and centromeres. This 

10 would thus create a human artificial chromosome that would 
be subjected to the same control mechanisms (regarding 
regulation and chromosomal segregation) as normal 
chromosomes and therefore serve as a vehicle for gene 
replacement therapy. This modification of the 

X5 extrachromosomal vector is therefore unique in that it 

will be a synthetic chromosome containing genes of choice, 
that will be expressed, and that will be maintained and 
regulated as if it were a normal chromosome. 

20 This cloning or expression vector may take on several 

forms. For example, two principal forms for employment 
are: (1) employed via extrachromosomal /episomal, 
autonomous replication and segregation which could even be 
amplified, and (2) employed via a human artificial 

25 chromosome under normal chromosomal control mechanisms. 

In general and overall scope, the present invention 
relates to the development of recombinant eukaryotic 
cloning and expression vectors based on unique regulatory 

30 elements isolated from autonomously replicating, stable 

episomal units isolated from human tumor cell lines. More 
particularly, these unique regulatory elements include 
origins of DNA replication, and DNA sequences that confer 
extrachromosomal stability and maintenance. These unique 

35 episomal regulatory elements will permit large pieces of 
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DNA to be expressed or cloned (greater than 50 kilobases 
pairs in size) . 

This invention discloses procedures for producing two 
5 different types of vectors. One is a cloning vector and 

the other one is an expression vector* For the purpose of 
this invention, the phrase "cloning vector" refers to a 
DNA vector designed to be used to clone a desired gene* 
The techniques that are involved in cloning vary from 
10 vector to vector and from system to system, however, these 

techniques in general are standard and known to those 
skilled in the art of recombinant DNA technology. 

Also, for the purpose of this invention, the phrase 
15 "expression vector" refers to a DNA vector capable of 

replication in selected mammalian host cells and 
expressing a desired protein. This protein may then be 
recovered from the cells by employing techniques known to 
those skilled in the art. 

20 

This cloning vector should include one or more 
functional origins of DNA replication to permit stable, 
autonomous replication. The phrase "origin of 
replication" is defined as a region that indicates the 
25 origin of replication. 



This cloning vector should include appropriate DNA 
sequences that confer extrachromosomal stability and 
maintenance. The sequences responsible for conferring 

30 extrachrom somal stability and persistence may be related 

to sequences responsible for nuclear matrix attachment 
sites, topoisomerase XI reaction sites, and/or other 
regions required for appropriate interactions with the 
nuclear architecture. This extrachromosomal stability and 

35 maintenance permits the introduction of large exogenous 

genes. 
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This cloning vector should also includ DNA 
selectable marker sequences that can be used to confer 
drug resistance to a transfected cell or DNA sequences 
that can correct a genetic mutation* This allows the 
5 cells that were transfected with the vector to be selected 

for. The DNA selectable marker segment confers upon a 
cell transfected with said vector, the ability to survive 
in the presence of a selected compound or selected group 
of compounds. The compound may be either G418 or 
10 hygromycin B. Also, other selectable marker segments will 

contain DNA encoding an enzyme capable of functionally 
replacing a mutated enzyme so as to render the transfected 
cell resistant to said selected compound or selected group 
of compounds. The enzyme may be selected from a group 
15 consisting of: thymidine kinase, xanthineguanine 

phosphor ibosyl transferase, adenine 
phosphor ibosyl transferase, adenosine deaminase and 
dihydrof olate reductase. 

This cloning vector should also include a multi-use 
multiple cloning site to facilitate recovery for genetic 
modification and analysis and insertion for reintroduction 
into cells for replication and expression. Multiple 
cloning cassette sequence cartridges are commercially 
available from several different companies (Promega, New 
England Biolabs, etc) • A typical cassette sequence would 
include restriction sites for 8-11 different enzymes 
(i.e. Eco RI, Sac 1, Sma 1, Ava I, Bam HI, Xba 1, Hinc II, 
Acc 1, Sal 1, Pst 1, Hind III, etc.) The availability of 
these cassette sequences are known to those skilled in the 
art. 

This cloning vector should also include a DNA segment 
encoding bacterial components necessary for propagation of 
35 said vector in bacteria. Bacterial components that sure 

essential for propagation of the cloning vector in 



20 



25 



30 
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bacteria are known to those skilled In this art. F r 
example, two bacterial components essential for bacterial 
propagation are a replicon that is responsible for 
initiation of replication and antibiotic resistant markers 
5 (i.e. ampicillin, tetracycline, etc.) that permits growth 

in specific antibiotics. 

In addition to the above described five different 
components included in the unique cloning vector, a unique 
10 expression vector capable of expressing large pieces of 

DMA (40 - 400 kb) should also include, a promoter, a 
polyadenylation site and a splice site in special relation 
to allow efficient expression of a structural gene. 

15 The choice of promoters to be included in this vector 

will depend on the mammalian host cell employed. It is 
advantageous to employ a compatible promoter with regard 
to the cells that the desired protein will be expressed 
in. The inventors prefer to employ promoters derived from 

20 the following genes (although other promoters would be 
satisfactory): cytomegalovirus, SV-40, Rous sarcoma 
virus, thymidine kinase, beta -act in, metal lothione in, and 
the epidermal growth factor receptor gene isolated from a 
DiFi episome. 

25 

For the purpose of this invention, a polyadenylation 
site refers to the site at which a poly A tail (a stretch 
of 50 to 300 adenines) is added to the vector for 
efficient expression of a desired protein in a mammalian 
30 cell. Also, the phrase n splice site" refers to a 

bacterial processing site essential to remove introns 
incorporated into the bacterial plasmid. These components 
are essential for optimal expression of a desired protein. 



35 A further embodiment of this invention is an 

artificial chromosome consisting of a DNA segment derived 
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f rom a non-viral epis me, said s gment containing an 
origin for DNA replication, a DNA segment derived from a 
non— viral episome, said segment containing a DNA sequence 
which confers upon said vector the ability to be stably 
5 maintained extrachromosomally in a cell transf ected with 
said vector, a DNA segment containing a multiple cloning 
site, a DNA selectable marker segment conferring upon a 
cell transfected with said vector, the ability to survive 
in the presence of a selected compound or selected group 
10 of compounds, a DNA segment encoding bacterial components 

necessary for propagation of said vector in bacteria, a 
promoter, a polyadenylation site, a splice site, a DNA 
segment encoding a centromere and a DNA segment encoding a 
telomere • 

15 

Further in accordance for this invention is a 
substantially purified non-viral episome of human origin 
capable of stable extrachromosomal maintenance and of 
autonomous replication in a compatible mammalian cell 
20 line. 



Further in accordance for this invention is a 
substantially purified episomal DNA segment containing an 
origin of replication* This invention further includes a 

25 substantially purified' episomal DNA segment containing a 

DNA sequence, which confers upon a vector including said 
segment, the ability to be stably maintained 
extrachromosomally in a cell transfected with said vector. 
Another embodiment of this invention is a substantially 

30 purified, episomal DNA segment containing both an origin of 

replication and a DNA sequence, which confers upon a 
vector including said segment, the ability to be stably 
maintained extrachromosomally in a cell transfected with 
said vector. 



35 



WO 92/07080 PCT/US91/07690 

-11- 



An ther embodiment of this invention is a DNA s gment 
containing an origin for DNA replication is from an 
episome isolated from DiFi colorectal cell line. 

5 Another embodiment of this invention is a DNA 

sequence which confers upon said vector the ability to be 
stably maintained extrachromosomally in a cell transfected 
with said vector is from an episome isolated from DiFi 
colorectal cell line. 

10 

Another embodiment of this invention is a DNA segment 
containing an origin for DNA replication and a DNA 
sequence which confers upon said vector the ability to be 
stably maintained extrachromosomally in a cell transfected 
15 with said vector is from an episome isolated from DiFi 
colorectal cell line. 



The various techniques which have been successfully 
applied to the cloning and expression of many genes in a 
20 variety of host systems, employing many different 

promoters and vectors, are known to those skilled in the 
art of recombinant DNA technology and could be applied to 
the embodiments described herein. 



25 For the purpose of this invention, the phrase 

"operatively spaced with respect to a desired gene" is 
defined as the appropriate positional spacing required 
between the numerous cloning and expression vectors 
components described in this invention so as to allow each 

30 of the of components to achieve its desired function. 

These components are also directionally positioned 5' to 
3 ' . The appropriate spacing needed for efficient cloning 
or expression of a desired gene is determined for each 
individual vector. 



35 



WO 92/07080 



PCT/US91/07690 



-12- 



10 



In terms of transf acting eukaryotic cells with thes 
unique cloning or expression vectors, the transfection 
techniques are standard and known to those skilled in the 
art of recombinant DMA technology. In terms of 
transfecting cells with the unique expression vector, this 
invention could also be applied for the production of 
stable cell lines which are, by definition, continuously 
producing the desired protein. The production of cell 
lines designed to continuously produce the desired protein 
has been described extensively in the literature, and is 
therefore known to those skilled in the art. 



f*TT<* p OF TH E DEPOSITED CELL LINE 

Cell line "DiFi 11 comprising cells obtained from the 
15 ascitic fluid of a colorectal tumor in a patient with 

Gardner's syndrome, is available from the ATCC, accession 
# CRI* 10576. This cell line retains 50 copies or more of 
extrachromosomal episomes, each of which contains at least 
one complete copy of the epidermal growth factor receptor 
20 gene* 

Fig. 1. Tn situ h ybridization of DiFi cells with BGFR. 

A portion of a metaphase from DiFi cells stained with 
Giemsa (A) , fluorescence visualization of in situ 
25 hybridization using biotinylated EGFR as probe and 

counterstained with propidium iodide (B) , and a black and 
white print of the fluorescence pattern of in situ 
hybridization (C) . 

30 Fig. 2. Blectroohoretic m obilization of EGFR genes bv 

qaTmna i rradiation. 
Autoradiogram of a Southern blot of a TAFE gel 
hybridized with 32P- labeled Origin (o) is indicated 

at the top as is the direction of migration. Plug samples 
35 1-8 were exposed to 0, 5, 10, 20, 40 80, 160, 320 Gray, 
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respectively . Hybridization membranes wer exposed to 
film for 24 hrs. 

Fig. 3. Effect of qflp«a irradiatinn nn the 
5 electrophoretic mobilization EGFR in A431. 

DiFi. and HeLa cells. 
Autoradiogram of a Southern blot of a TAFE gel 
hybridized with 32P-labeled EGFR , Origin and direction of 
migration is as in Fig. 2. A431, DiFi and HeLa cell DNA 
10 plugs were irradiated with A. OGy, B. 10 Gy, C. 40 6y, D. 

160 Gy. Autoradiographic exposure was extended to 72 hr 
in order to enhance sensitivity for detecting any 
fragments that might have migrated from the A431 plugs. 

15 Fig. 4. CHEF ana lysis of EGFR in gamma irradiate d DiFi. 

Plugs containing DiFi DNA were exposed to 31.4 Gy 
prior to electrophoresis. The analysis of control (c) and 
irradiated (R) samples was performed in duplicate. 
Approximate sizes of the observed fragments, in kbs, are 

20 indicated to the right. 

INTRODUCTION TO THE DISCLOSED INVENTION; 
AUTONOMOUSLY REPLICATING. STABLY MAINTAINED 
MICROCH ROMOSOMAL UNITS FROM HUMAN TUMOR CELL LINES 

25 In developing the invention, we elected to use stably 

maintained extrachromosomal units arising in some 
eukaryotic cell lines as starting material , because these 
units contain all the genetic regions required for 
autonomous replication and extrachromosomal expression. 

30 Those steps are described below. 

In initial studies, the episomes are isolated from 
the origin in a substantially purified form and the 
minimal essential elements for episomal replication and 
35 transcription are localized and isolated. Those elements 

are then ligated into a selected DNA molecule, together 
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vlth additional DMA 8 gments, including, for example, 
selectable markers, multiple cloning site or sites, 
segments necessary for propagation in bacteria and/or a 
promoter enhancer, splice site and poly adenylat ion site* 

5 

Replication of nuclear DNA in eukaryotes appears to 
be under precise and reproducible control, such that it is 
replicated only once in each S -phase, the DMA synthetic 
portion of each cell division cycle. In addition, each 
10 portion of the genome replicates at the same time in each 

S-phase, with expressing (transcribed) genes replicating 
early and non-expressing and/or structural DMA replicating 
late* 

The genomes of prokaryotes, viruses, and yeast 
contain DMA sequences called origins, that serve as sites 
for initiating cycles of DMA replication. By analogy, 
such sites define replicating units, or replicons, in 
eukaryotic cells such as human cells. 

An accepted working hypothesis is that the eukaryotic 
nucleus is organized into structural domains in which the 
nuclear matrix plays an essential role in organizing 
chromatin structure and in regulating function. Support 
for this hypothesis comes from studies demonstrating that 
DMA replication, DMA repair, transcription and 
post-transcriptional processing are associated with the 
nuclear matrix. Other studies have shown that DMA 
polymerase, RNA polymerase XI, expressing and expressible 
genes, transcriptional enhancer sequences, topoisomerase 
II cleavage sites, topoisomerase II, and heterogeneous 
nuclear RNA (hnRNA) splicing complexes are highly enriched 
or specifically localized in the nuclear matrix. 



15 



20 



25 



30 



35 The fact that regulatory DMA sequences and the 

nuclear proteins with which they interact have not been 
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identified is in part attributabl to th unmanageable 
size of chromosomes and the complexity of the genetic 
elements they contain. However, stable cell lines are 
occasionally established in which regions of specific 
5 genes have been amplified ( Stark f Cancer Surveys, 5:1-23, 
1986) and occasionally are segregated into autonomously 
replicating components. These exist in the nucleus as 
episomes (200 kb - 800 kb molecules) and/or light 
microscope-visible double minute chromosomes (dm ins, >1000 

10 kb) • 

This invention exploits these cell lines by isolating 
and investigating the structure and replication control of 
their extrachromosomal elements in order to identify DNA 

15 sequences required to ensure their autonomy for stable 

maintenance, replication and gene expression. This 
minimal essential structure should then provide the core 
structure with which to assemble a cloning and expression 
vector for genes exceeding sizes accommodated by cosmid 

20 vectors. 



Although the methodology described herein contains 
sufficient detail to enable one skilled in the art to 
practice the present invention, a commercially availbale 

25 technical manual entitled MOLECULAR CLONING (Maniatis, et. 

al. , Cold Spring Harbor Laboratory, Cold Spring Harbor, 
New York) may provide some additional details useful to 
assist practice of some aspects of this invention. 
Accordingly, this manual is incorporated herein by 

30 reference. 



The following examples are designed to illustrate 
certain aspects of the present invention. However, 
they should not be construed as limiting the claims 
35 thereof. 



WO 92/07080 



PCT/US91/07690 

16- 



example 1 

EXTRACHROMOSOMAIi AMPLIFICATION 
OF THE EPIDERMAL GROWTH FACTOR RECSPTOR GEHE 
ZH A HUMAN COLON CARCINOMA CELL LXNE 

5 This example describes the isolation and 

characterization of an autonomously replicating episomal 
unit derived from a human colorectal carcinoma cell, 
established from ascites from a patient with Gardner's 
syndrome, designated "Difi" (Bowman, et al., In: 

10 Hereditary Colorectal Cancer . J. Utsunomiya and H. Lynch 

(Eds.), Springer-Verlag, In Press, 1990). The invention 
is not limited to the M Difi M episome, however, for the 
basic procedures provided by the present disclosure should 
enable those of skill in the art to develop vectors from 

15 the episomes of other cells . 



DiFi cells were (1) successfully established in 
tissue culture, (2) shown to contain amplified EGFR genes 
and mRNA, and (3) characterized cytologically to be near 
20 tetraploid with the presence of double minutes (dmin; 

Bowman et al. In Hereditary Colorectal Cancer . J. 
utsunomiya and H. Lynch (eds) , Springver lag , In Press, 
1990) . 

25 Xa. CELL LINES EMPLOYED AND CELL COT/TORE CONDITIONS 

A431 (obtained from Gary Gal lick, M. D. Anderson 
Cancer Center) , HeLa and DiFi cells were maintained in 
Dulbecco's medium supplemented with 5% fetal and 5% 
newborn calf serum. SW480 cells, a colon tumor cell line 
30 (established by Leibovitz, 1976 and obtained from Mark 

Blick, M. D. Anderson Cancer Center) were grown and 
maintained in L-15 medium containing L-glutamine and 
supplemented with 10% fetal calf serum, insulin (5ug/ml) 
and glutathione (16ug/ml) . 



35 
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A. Characteristic of a human colorectal cancer cell 

line fPiri) 

"DiFi" colorectal carcinoma cell line represents one 
of the first cell lines to be established and 
5 characterized from a patient with Gardner syndrome. 

Malignant ascitic fluid cells were isolated from a 46 
year old female rectal cancer patient with Gardner 
syndrome and initiated to grow in culture. The cells have 

10 been maintained in culture for over three years. Hoechst 
stain analysis for mycoplasma was negative. Subcutaneous 
injection of DiFi cells into athymic mice demonstrated 
tumor production in 50% of the mice. The cells have a 
tetraploid karyotype, and possess an isozyme pattern 

15 characteristic of colorectal cancer cell lines. 



LOCALIZATION OF EGFR DNA TN DiFi CELLS BY IN SITU 
HYBRIDIZATION 

The following studies demonstrated the episomal 
20 location of the amplified EGFR gene. 

Slides containing metaphase cells from either DiFi or 
SW480 cells were prepared and stored at room temperature* 
Prior to in situ hybridization with a biotinylated EGFR 

25 probe, the slides were stained (six minutes in 5% Giemsa 

prepared in phosphate buffer pH 6.8) and photographed. In 
situ hybridization involved treating the photographed 
slides with RNAse, DNA denaturation and dehydration 
solutions, overnight incubation in a hybridization mix 

30 containing a biotinylated EGFR probe, and tagging the 

regions of EGFR hybridization with f luorescein-avidin and 
biotinylated goat anti-avidin. This procedure resulted in 
a three layers of f luorescein-avidin, and visualization by 
fluorescence microscopy (Finkel et al. r Proc . Natl . Acad . 

35 Sci. USA . 83:2934, 1986). 
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The EGFR cDNA probe, HER-A64-3 (Ullrich et al. , 
Nature 309:418-425, 1984), was labeled by nick translation » 
with biot in-7 — dATP according to the instructions provided 
by BRL. Hybridization mix (25 ul) containing 10% PEG 6000 
5 and 5 ng of probe was used on each slide. Following in 
situ hybridization and fluorescence labeling procedures, 
slides were rinsed and counterstained in propidium iodide 
(2 ug/ml in H 2 0) for two minutes, rinsed with H 2 0, and 
carefully blotted dry. Two drops of antifade solution 
10 (Johnson and Aroujo, J. Immunol . Methods 43:349-350, 1981) 

were added to each slide before covering with a cover slip. 
Metaphase chromosomes were photographed under epi-UV- 
illumination on Kodak Ektachrome 160 film using the Zeiss 
filter 25 combination 48 77 09. 

15 

Giemsa— stained metaphase chromosomes from DiFi cells 
revealed a background of extrachromosomal particles at the 
limit of optical resolution (Fig. 1A) • Occasionally, they 
were paired in the form of small dmins. To determine 
20 whether these structures contained copies of the EGFR 
gene, the biotinylated A64-3 cDNA EGFR probe was 
hybridized to these metaphase cells. SW480 cells served 
as a negative control because their dmins are amplified 
for MYC rather than EGFR (Untawale, Masters Thesis on File 

* 

25 at the Graduate School* of Biomedical Sciences, University 

of Texas Health Science Center, Houston, Texas, 1987; 
Untawale and Blick, Anticancer Res. 8:1-8, 1988). 
Thirty-five SW480 metaphase cells were examined for 
hybridization with biotinylated A64-3 cDNA EGFR probe. No 

30 hybridization was observed to any metaphase chromosome or 

extrachromosomal entity (data not shown) . The same 
analysis was performed with DiFi metaphase spreads and 
thirty-three out of sixty-six demonstrated strong 
hybridization to extrachromosomal regions. No conclusions 

35 could be drawn from the remaining thirty-three metaphase 
cells due to weak hybridization or high background. 
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Pigur 1 presents In situ hybridizati n f DiFi 
metaphase cells with EGFR probe. A portion of a metaphase 
spread from DiFi cells was stained with Giemsa (1A) . 
Fluorescence visualization of in situ hybridization using 
5 biotinylated egfr as a probe and counterstained with 

propidium iodide is shown in IB, and a blade and white 
print of the fluorescence pattern of in situ hybridization 
is shown in 1C. 

10 In the Geimsa stained metaphase (1A) the chromosomes 

are intensely stained in contrast to the diffuse staining 
of extrachromosomal material in the background. The 
extrachromosomal background appears to be dmin, which vary 
in their size and visibility. Hybridization of the 

15 biotinylated EGFR probe (yellow fluorescence) was limited 

to extrachromosomal regions containing dmin, rather than 
chromosomal DNA (IB) • In ord^r to emphasize the 
extrachromosomal hybridization the photograph was printed 
in black and white (1C) . In Figure 1C, the 

20 extrachromosomal labeling was visualized more clearly 

since the fluorescein fluorescence is more intense in dmin 
than isothe propidium fluorescence from the chromosomes. 

Therefore, in situ hybridization of the biotinylated 
25 EGFR probe in the DiFi cell line demonstrated localized 

hybridization predominantly in extrachromosomal regions 
rather than to chromosomal DNA. 

The in situ hybridization analysis presented in 
30 Figure IB and 1C consistently demonstrated specific 

biotinylated EGFR localized in the extrachromosomal 
background. This specific localization is most likely 
associated with episomes many of which are too small in 
size and disorganized in structure to be visualized as 
35 dmins in standard cytogenetic spreads. 
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TTT. PREPARATION AND IRRADIATION OF DNA 

After confirming that the EGFR amplification observed 
in the DiFi cells was mediated by a stable episomal 
fraction, we next sought to isolate that fraction from the 
5 cells using the procedures described below. 

Cells were embedded/ lysed and deproteinized in 
agarose blocks in order to minimize shear damage to the 
DNA (Smith et al., In Methods in Enzvmoloav. M. Gottesman 

10 (Ed.), Academic Press, San Deigo, Vol 151, p. 461., 1987). 

Agarose blocks, with each sample containing approximately 
3 ug of DNA, were cut to fit gel slots. Samples were 
suspended in 1 ml of TAFE buffer (10 mM Tris-acetate, pH 
8.0; 0.5 mM EDTA) in 12 x 75 mm polystyrene culture tubes 

15 and exposed to m Cs gamma rays at a dose-rate of 45 

Gray/min to linearize the DNA for pulse field 
electrophoresis (van der Blick et al. r NAR 16:4841-4851, 
1988; Beverly, NAR 16:925-939, 1988; Ruiz et al., Mol. 
Cell. Biol. 98: 109-115 , 1989). The inventors exposed 

20 agarose plugs containing unsheared DiFi cellular DNA to 

varying doses of gamma radiation prior to analysis by 
pulse-field gel electrophoresis. Appropriate levels of 
exposure were estimated based on an expected yield of 1.1 
x 10** double-strand breaks/Gy/bp (calculated from Krisch 

25 et al., Rad. Res. 101:356-372, 1985). 

IV. PULSED— FIELD GEL ELE CTROPHORESIS WAS EMPLOYED TO SIZE 
DNA 

Following irradiation, the samples were loaded into 
1% agarose gels and subjected to transverse alternating 
field electrophoresis (TAFE) using TAFE buffer in a 
GeneLine system (Beckman Instruments) • Agarose plugs 
containing yeast chromosomes or concatemers of lambda 
phage DNA were included on gels as size standards. 
Initial current was held constant at 170 ma for 30 min, 
reorienting the direction of the electrical field every 4 
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s c, follow d by a constant current of 150 mA f r 18 hr 
with a field reorientation interval of 60 sec. 

some experiments employed the clamped homogeneous 
5 electrical field (CHEF) protocol for pulsed-field gel 
electrophoresis (Chu et al., SS1SDS& 232:65-68, 1986). 
Here, electrophoresis was performed in 0.5x TOE buffer (45 
mM boric acid, 45 mM Tris and 2 mM EDTA, pH 8.3) at a 
constant current of 70 volts reoriented every 15 ain for a 
10 total of 3 days. 

Upon completion of electrophoresis, staining (0.5 
ug/ml ethidium bromide) , and photography, gels were 

15 irradiated for 5 min with 254 nm UVL (Gelman Instrument 

Co., Model 51438). This was followed by gentle shaking in 
0.25 M HC1 for 5 min for depurination, rinsing in 
deionized water, soaking in 0.4 M NaOH for 1 hr for 
hydrolysis of depurinated bases, rinsing in deionized 

20 water, and soaking in 0.2 M NaOH, 0.6 M NaCl for 1 hr for 

denaturation. The DMA was transferred to a Zetabind nylon 
membrane (AMF Cuno, Inc.) in the denaturing solution for 
15-20 hrs. The filter was then treated with two 15 min 
washes in a neutralizing solution (0.5 M Tris-HCl, pH 7.5; 

25 1.5 M NaCl) and dried in a vacuum oven at 80°C for 1 hr. 

Labeling of probe, hybridization to filters and 
autoradiography for visualization of fragments were 
performed as previously described (Amasino, Anal, Biochem. 
152:304-307, 1986; Liu et al. , Science 246:813-815, 1989). 



30 



Figure 2, an autoradiogram of a Southern blot of a 
TAFE gel probed with 32P-labeled ££EE, demonstrates 
electrophoretic mobilization of ES£E genes by gamma 
irradiation. The origin (o) as well as the direction of 
35 migration is indicated at the top of the figure. Plug 

samples 1-8 were exposed to 0, 5, 10, 20, 40 80, 160, 320 
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Gy, respectively* Hybridization membranes vere exposed to 
film for 24 hrs. 

Southern analysis of a gel hybridized with an EGFR 
5 probe demonstrated the dose dependent migration of two 

different sized fragments containing EGFR sequences (Fig. 
2) . The pattern of migration of total DNA was observed by 
staining gels with ethidium bromide (data not shown) . 
Dose-dependent increases were observed in the amount of 

10 random sized DNA fragments migrating between the sample 

well and the front of each lane. Increased amounts of DNA 
also accumulated in the zone representing molecules of 
2500 kb or larger under the electrophoresis conditions 
employed. The EGFR -containincr fragments migrated at a 

15 position consistent with approximately 650 kb and 1300 kb 
representing faster and slower migrating forms, 
respectively. The origin is indicated by "O* 91 

Figure 3, an autoradiogram of a Southern blot of a 
20 TAFE gel probed with 32P- labeled EGFR . demonstrates the 
effect gamma irradiation has on the electrophoretic 
patterns of migration of EGFR sequences in A431, DiFi, and 
HeLa cells. The origin and direction of migration are as 
in Fig. 2. DNA plugs from A431, DiFi and HeLa cells were 
25 irradiated with increasing amounts of radiation: Lane 

(A) : OGy; Lane (B) : 10 Gy; Lane (C) : 40 Gy; Lane (D) : 160 
Gy. The autoradiographic exposure was extended to 72 hr 
in order to enhance sensitivity for detecting any 
fragments that might have migrated form the A431 plugs. 

30 

Dose dependent increases were observed in the amounts 
of randomly broken DNA fragments migrating from sample 
wells into each lane. As is observed, EGFR amplification 
35 is much higher in DiFi DNA and A431 DNA when compared to 

HeLa DNA. More importantly, sample plug irradiation did 
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not: release discrete sizes of KeLa and A431 EGFR sequences 
were (confirmed by exposing autoradiograms for 7 days, 
data not shown) . However, mobilization of both the 650 kb 
band and 1300 kb band DiFi EGFR fragments were readily 
detected. To summarize, EGFR sequences in both HeLa and 
A431 DNA appear to be chromosomal ly localized. In 
contrast, EGFR sequences in DiFi DNA appear to be 
episomally (extrachromosomally) localized and may be 
substantially purified by the procedure described here. 



Figure 4 presents CHEF analysis of EGFR from gamma 
irradiated DiFi DNA. Plugs containing DiFi DNA were 
exposed to 31.4 Gy prior to electrophoresis. The analysis 
of control (c) and irradiated (R) samples was performed in 

15 duplicate. Approximate sizes of the observed fragments, 
in kbs, are indicated to the right* Irradiating DiFi 
plugs and conducting CHEF electrophoresis under conditions 
that resolve larger DNA fragments revealed the presence of 
a weakly hybridizing band of approximately 2,000 kb, in 

20 addition to the 650 kb and 1300 kb fragments (Fig. 4) . In 
unirradiated control lanes (C) a small portion of 
£g£B-containing molecules were observed to have migrated 
into the gels. This observation was previously attributed 
to degradation of cellular DNA during the preparation of 
25 agarose plugs (van der Blick, et al., NAR 16:4841-4851, 

1988} . 

VI. SUMMARY 

In situ hybridization, using a biotinylated cDNA 
30 probe for the epidermal growth factor receptor ( EGFR ) 

gene, demonstrated that amplified EGFR in colon tumor cell 
lines, DiFi, is localized to many small double minute 
chromosomes of varying size and visibility. Analysis of 
the electrophoretic mobility of gammairradiated DNA from 
35 DiFi by pulsed-field gel electrophoresis and Southern blot 

hybridization using EGFR probe, indicated that the 
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amplifi d EGFR in DiFi exists in extrachromosomal , 
covalently-closed circular episomes, probably equivalent: 
to dmin. Two major and one minor species were observed 
having estimated sizes of 650 kb, 1300 kb, and 2000 kb. 
5 The DiFi cell line appears to represent a unique case of 
extrachromosomal EGFR gene amplification in human cells. 
DiFi represents the first example of a stably maintained 
episome in which EGFR is amplified* 



10 gaEMPPftg 9 

CONSTRUCTING A MAMMALIAN EPISOMAL 
EXPRESSION OR CLONING VECTOR 

The identification, characterization and isolation of 
DNA regulatory regions within the episomes that function 

15 a) as origins of autonomous DNA replication, and b) 

function as stabilizing regions for extrachromosomal 
maintenance will permit the construction of cloning and 
expression vectors that replicate and function as 
extrachromosomal vectors. The following is meant to serve 

20 as one example of identifying and isolating such 

regulatory factors from the episomal unit maintained in 
human tumor cell. In some instances, reference is made to 
working with the episomal unit from DiFi cells; DiFi is 
used here only as an example. 

25 

It. IDENTIFICATION AND ISOLATION OF REGULATORY ELEMENTS 
IN STABLE EPISOMAL UNITS ESTABLISHED IN HUMAN TUMOR 
CELL LINES 

A» Episome Isolation 
30 In order to identify and isolate replication 

regulatory elements from an episome, the episome itself 
must first be isolated. 



35 



The ideal starting point is a preparation that is 
highly enriched for the episomes of interest. A highly 
enriched source of fiGER-containing episomes is the human 
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DiFi c 11 line. DNA will b isolat d from this enriched 
preparation and most of the DiFi genomic DNA can be 
eliminated from this preparation by employing an alkaline 
lysis modification (Griffin, et al«, J. Virol. . 40:11-19/ 
5 1981) • An essentially pure preparation of DiFi episomes 
can then be obtained by preparative electrophoresis on 
agarose gels that permits the mobilization of covalent 
circular DNA molecules (Carroll et al., Mol. Cell* Biol. . 
7:1740-1740 (1987)). These molecules can then be 
10 recovered from the gels by procedures that dissolve or 

digest (agarose) the agarose and permit the episomal DNA 
to be purified directly from the digest. 



Determine a Restriction Mao of the Episomal 

15 ggflQroe t 

A restriction enzyme analysis will be performed after 
the episome is isolated. For example, most of the DiFi 
episome can be separated into two pieces by exploiting the 
limited number of sites susceptible to restriction enzymes 

20 Mlul (2 sites) and NotX (2 sites) . Hlul cuts at two 
closely spaced sites whereas NotX cuts at two widely 
distant sites. Table 1 presents macrorestriction fragment 
sizes of DiFi episomes digested with Hlul and NotX 
restriction enzyme. 

25 

TABLE 1 

MACRORESTRICTION FRAGMENT SIZES OF EPISOMES 

DIGESTED WITH Mlul AND Not I RESTRICTION ENZYME 

Restriction Enzyme Fragment Sizes 

30 Mull - 50 kb, - 600 kb 

NOtI - 270* kb f - 380** kb 

Mlul + NotI -50 kb, -220 kb, -380 kb 

* The 3' end of this fragment contains the 5' 
35 untranslated region, exon I, and the 5' end of 

intron X of the EGFR gene. 
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** The 5 7 end of -this fragment: contains the 

remainder of the EGFR gene from intron I through 
the 3' terminus of the gene, 

5 

Digestion of total DiFi DMA with Mlul and 
electrophoresis on agarose gels using a pulsed field gel 
electrophoresis format (Chu et al.. Science 232:65*68, 
1986) permits isolation of the region in the gel 

10 containing DNA fragments of -600 kb. Digesting the 

agarose plugs with NotI further reduces the size 
distribution pertaining to genomic DNA and also cleaves 
the DiFi episome into its expected fragments. This 
protocol yields identifiable and highly enriched DiFi 

15 episomal fragments on a background of digested genomic 

DNA. The individual episomal NotI fragments (-220 and 
-380 kb) are concentrated by electrophoresis in a second 
dimension, and then recovered from the gel by procedures 
that dissolve or digest agarose, thereby allowing 

20 purification of the desired DNA fragments for cloning. 

C. Construction of DiFi Episome Recombinant DNA 
frjfrrarjegs 

1. Lambda Libraries 

25 Lambda libraries were constructed that represented 2 

to 10 kb portions of the DiFi episome by utilizing 
partially restriction enzyme digested episomes or NotI 
fragments and the Lambda-Zap phagemid vector (Short, 
Fernandez, Sorge, and Huse, Nuc. Acids Res ,. 16:7583-7600, 

30 1988). 

2. Cosmid Libraries 

Cosmid libraries are constructed with BamHI partial 
digests of isolated episomes or NotI DiFi episomal 
35 fragments by utilizing the sCosl vector (Evans, et al, 

Gene . 79:9-20, 1989). These cosmid libraries represent 
portions of the DiFi episome in approximately 40 kb 
blocks. 
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3. Pi Libraries 

Recombinant DNA libraries containing portions of the 
DiFi episome are constructed by utilizing the PI 
bacteriophage based cloning vector (Sternberg, Proc. Nat. 
5 Acad, sci. USA , 87:103-107, 1990). This Pi library 

contains DiFi episomal portions representing two size 
ranges: less than 30 Kb and approximately 85 - 110 kb. 

4. Plasmid libraries 

10 Recombinant DNA libraries containing portions of the 

DiFi episome are constructed utilizing an E. coli F sex 
factor based cloning vector (Leonardo and Sedivy, 
Biotechnology . 8:841, 1990). This F plasmid library 
contains DiFi episomal portions up to at least 150 Kb. It 

15 should be understood that other plasmid libraries can be 

constructed using one of several available plasmid vectors 
(i.e. pKS/ pT7\T3a-18, etc.). These vectors are known to 
those skilled in this art. 

20 SU. Identification of Functional Regions Within 

ppisomes Regulating DNA Replication 
In order to identify distinct episomal regions for 
replication, various portions of recombinant DiFi episomal 
DNA libraries (from the above section) are first 

25 introduced into appropriate mammalian host cells (Krysan, 

et. al., Mol. Cell. Biol. . 9(3):1026, 1989). Autonomously 
replicating segments from the DiFi episome are first 
identified and the isolated segment is incorporated into a 
cloning or expression vector. Any transfection method may 

30 be employed for introducing portions of the recombinant 

library into mammalian host cells (i.e. calcium phosphate 
transfection (Chen and Okayama, Molec. Cell. Biol. 
7:2745*2752 (1987)); electroporation (Chu et al., Nucl 
Acids Res. 15:1311-1326 (1987)). 

35 
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For example, p ols of approximately 10 different 
plasmid vector clones from the DiFi Cosl library are 
introduced into for example, HSF56 human primary 
fibroblast cells via calcium phosphate transf action or 
5 electroporation. Each Cosl vector clone contains a 

selectable marker that confers drug resistance to 6418, 
for example. Retention and replication of transf ected 
clones are identified by growing the transfected 
population of HSF56 cells in the presence of 64X8, a 

10 compound which specifically selects for cells that are 
neomycin resistant. The cells are placed under 6418 
selection 2 days after transf ection, and 6418 resistant 
populations are grown for at least two months by 
maintaining the resistant clones appropriate subculturing 

15 techniques known to those skilled in the art of tissue 



Neomycin resistant clones that persist for several 
cell divisions therefore contain a DiFi Cosl vector clone 

20 that is replicating. A persistent neomycin resistant cell 

clone is recovered and low molecular weight DNA (less than 
120 kb) is isolated by the HIRT extraction method (Hirt, 
J- Mol. Biol. . 26:265-369, 1967). The DNA isolated from 
this neomycin resistant cell clone will be subcloned into 

25 plasmid vectors that accommodate smaller inserts, such as 

the pKS vector or the pT7/T3a-18 vector, which, 
preferably, will also contain a selectable marker, such as 
a gene encoding beta lactamase, which confers resistance 
to ampicillin. 

30 

The result of this will be another plasmid library 
which includes specific regions, one or more of which 
contain an origin for DNA replication. The clones from 
this new library will next be introduced into bacteria and 
35 bacterial colonies resistant to, for example, ampicillin, 

will be isolated. In a preferred embodiment, the host is 
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an E. coli cell of a type which is compatible with the 
vector type. 

To determine if the DNA in the bacterial colonies 
5 contain an origin of DNA replication, the DNA from the 
ampicillin resistant bacterial colonies will be 
trans fee ted into a mammalian cell line. The DNA (isolated 
with the HIRT extraction method) from the transfected 
mammalian cells will be analyzed by the Dpn I digestion 
10 (Krysan et al., Molec. C ell. Biol, 9:1026-1033, 1989 which 

is incorporated herein by reference) . DNA exhibiting the 
bacterial methylation pattern is cleavable by Dpn X 
restriction enzyme while DNA with mammalian methylation 
pattern is not. Thus, DNA that is not digested by Dpn 1 
15 has replicated in the mammalian cell. The origins for DNA 
replication will then be identified within the inserts in 
autonomously replicating clones. The origin can then be 
removed from the vector, and inserted into the recombinant 
cloning vector. Vectors that include regions from the 
20 DiFi episome are designated pDFE ori + and will serve as 

the recipients for inclusion of other regions of the DiFi 
episome conferring episome maintenance. 

Identification of Function al Regions Within 
25 Epjsomes Regulating Extrachromoso mal Maintenance 

Identifying those individual clones that contain a 
region conferring extrachromosomal stability is determined 
by long term culturing (longer than two months) in the 
presence of a selection drug. The clones that survive the 
30 continuous exposure to the selection drug must contain a 
region that confers extrachromosomal stability. 

Briefly, clones that persist during several cell 
division cycles will also be evaluated to identify regions 
35 within episomal DNA that confer stability for maintenance 
of extrachromosomal molecules. The procedure by which 
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10 



is lati n of this region is essentially the same one as 
described for identifying the replication region, except 
that vectors containing DiFi episomal origins of 
replication will be used to clone other restriction 
fragments from the DiFi episome. Once the first round of 
drug resistant cell colonies are identified, the episomal 
DNA may be isolated and introduced into bacteria and 
bacterial colonies resistant to, for example, ampicillin, 
will be isolated. 



To determine if the DNA in the bacterial colonies 
contain a region conferring extrachromosomal stability, 
the DNA from the ampicillin resistant bacterial colonies 
vill be transfected into a mammalian cell line. The DNA 
15 (isolated with the HXRT extraction method) from the 

transfected mammalian cells will be analyzed for fragment 
size and, depending on that size, another cycle may be 
initiated to further reduce the size of the piece of DNA 
that confers the extrachromosomal stability. 

20 

In addition to evidence for extrachromosomal 
stability that is provided by the vector's provision of 
drug resistance, the intranuclear localization of vector 
episomes will be evaluated. Vector-containing cells are 

25 treated with the non-ionic detergent Triton X-100 and 2N 
NaCl. This treatment produces salt extracted residual 
nuclei, called nucleoids, which can be centrifuged into a 
pellet at low speeds. Vectors associated with the nuclear 
matrix will pellet with the nucleoids; if they do not 

30 pellet with the nucleoids they will remain in the 

extracts' supernate. 

Both the region for the origin of DNA replication and 
for extrachromosomal maintenance will be defined as the 
35 core structures of both the cloning and expression vector 

and vill be designated PDFE ori + mat***. 
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Z-l. Construction of Optimal Eukarvotic Clonincr 

Vector to Accommodate 40 kb - 40Q kb Pieces of 
DNA- 

Once the core structure is determined, construction 
5 of the optimal eukaryotic cloning or expression vector 
will be completed. This is accomplished by adding the 
following three features to the core structure (these will 
be discussed below) : 

10 a. a DNA or genomic DNA region encoding at least 

one selectable marker; 

b. a DNA or genomic DNA region encoding a multiple 
cloning site; and 

15 

c. a DNA or genomic DNA region encoding bacterial 
components necessary for propagation of the 
vector in bacteria. 

Selectable markers, for mammalian cells, confer 
resistance to a specific selection agent once DNA 
conferring the resistance is transfected into individual 
cells possessing a genetic inheritance pattern appropriate 
for the selectable marker being used in the vector. There 
are a variety of different dominant and recessive 
selection agents known to those skilled in the art. Any 
one of the following genes and agents should be effective 
in terms of employing a selection system: 

□ G418 resistance is selected by exposure to 

medium containing 100 to 800 ug/ml G418. 6418 
selects for cells deficient in the enzyme 
aminoglycoside phosphotransferase and are 
referred to as neomycin resistant cells. 
(Southern and Berg, J. Molec. AppI. Gen.. 



20 
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1:327-341, 1982; C lber -Garapin et al., i-. 
Molec- Biol.. 150:1, 1981). 

HAT resistance for forward selection 
(converting a thymidine kinase minus cell to a 
thymidine kinase positive cell) is selected with 
complete medium supplemented with 100 uM 
hypoxanthine, 0.4 uM aminopterin, 16 uM 
thymidine and 3 uM glycine. HAT medium selects 
for variants defective in either 
hypoxanthine-guanine phosphor ibosyl -transferase 
or thymidine kinase (Littlef ield, Proc. Natl. 
Acad. Sci. USA . 50:568, 1963; Littlef ield, 
Science . 145:709-710, 1964). 

Hygromycin B resistance is selected by 
exposure to complete medium supplemented with 10 
- 400 ug/ml hygromycin B. Hygromycin B selects 
for variants defective in the enzyme 
hygromycin-B-phosphotransf erase (Gritz and 
Davies, Gene , 25:179-188, 1983; Santerre, et 
al., Gene , 30:147, 1984; Palmer, et.al., Proc. 
Natl. Acad. Sci. USA . 84:1055-1059, 1987). 

Adenine phosphor ibosyltransf erase (AFRT) 
positive variants are selected by exposure to 
medium supplemented with 25 uM alanosine, 50 uM 
azaserine and 100 uH adenine (Lovy, et. al., 
Cell , 22:817, 1980; Adair, et. al. , Proc. Natl. 
Acad. Sci. USA , 86:4574-4578, 1989). 

Xanthine - Guanine 
Phosphoribosyltransf erase (XGPRT) positive 
variants are selected with complete medium 
supplemented with dialyzed fetal calf serum, 250 
ug/ml xanthine, 15 ug/ml hypoxanthine, 10 ug/ml 



WO 92/07080 PCT/US91/07690 



-33- 



thymidin , 2 ug/ml aminopterin, 25 ug/ml 
mycophenolic acid, and X50 ug/ml L-glutamine 
(Mulligan and Berg, Proc. Natl. Acad. Sei. USA. 
78:2072-2076, 1981). 

5 

□ Methotrexate resistance is selected by 

exposure to complete medium supplemented with 
0.01 uM - 300 uM methotrexate and dialyzed fetal 
calf serum. Methotrexate selects for cells 
10 expressing high levels of dihydrof olate 

reductase (O'Hare, et al., Proc. Natl. Acad. 
Sci. USA . 78:1527, 1981; Simonsen and Levinson, 
Pro. Natl . Acad- Sci. USA. 80:2495-2499, 1983). 



15 □ Deoxycof ormycin resistant cells are 

selected by exposure to complete medium 
supplemented with 10 ug/ml thymidine, 15 ug/ml 
hypoxanthine, 4 uM 9-B-D- xylof uranosyl adenine 
(XylA) , and 0.01 - 0.03 uM 2 ' -deoxycof ormycin 

20 (dCF) . This selection selects for mutants 

expressing adenosine deaminase (ADA; Kaufman, 
et. al., Proc. Natl. Acad. Sci. USA. 
83:3136-3140, 1986). 



25 For added ease in handling and manipulating, this 

optimum eukaryotic cloning vector could include a DNA 
region comprising a multiple cloning cassette sequence 
containing infrequent cutting by restriction enzymes to 
facilitate the insertion of a desired gene. Multiple 

30 cloning cassette sequence cartridges are commercially 

available from several different companies (Stratagene, 
Promega, New England Biolabs etc) • A typical cassette 
sequence cartridge would inc.. ade restriction sites for 8 - 
11 different enzymes (i.e. Eco Rl, sacl, sma l, Ava l, Bam 

35 HI, Xba 1, Hinc II, Acc 1, Sal 1, Pst l, Hind III, etc.). 
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The availability of these cassette cartridges are known to 
those skilled in the art. 

The bacterial plasmid sequences may be derived from 
5 any one of the many different vectors that are 

commercially available and known to those skilled in the 
art of recombinant DNA technology. For the purpose of 
this invention, pUC, pKS, pBR322 and pT7/T3al8 are used as 
a matter of preference, however, other vectors would be 
10 equally effective. For example, if pBR322 sequences are 

introduced into the cloning or expression vector, the 
resulting recombinant can then be shuttled back and forth 
between E. coli and mammalian cells. 

The construction of an optimal eukaryotic expression 
vector that can accommodate 40 kb - 400 kb pieces of DNA 
will also contain, in addition to the elements described 
for the cloning vector, a DNA region containing a 
promoter, a polyadenylation and splice site necessary for 
the expression of the desired gene. 

There are at least two approaches for constructing an 
optimal eukaryotic cloning vector that can accommodate 40 
- 400 kb pieces of DNA. 
25 

1. The first and more simpler approach is to begin 
with a readily available cloning plasmid vector 
capable of propagation in bacteria. There are 
many different vectors known to those skilled in 

30 the art that would work efficienty. Several 

different components and features can easily be 
ligated into this bacterial plasmid vector. 
These added features are discussed below. Once 
completed, the vector will not only have the 

35 core structure (to confer the ability to 

replicate DNA and to be maintained 



15 



20 
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xtrachromosomally) but will also have the added 
features to optimize the vector for propagation 
in bacteria and for identification of its 
presence after transfection into a mammalian 
5 cell recipient. 

2. The second approach involves custom designing 
andcreating the optimum cloning vector by 
ligating all the desired features and components 
10 (including the core structure) together to 

generate the vector of choice. 

Sul. Construction of a Mammalian Artificial C hromosome 

The episomally maintained and replicated vector pOFE 
15 ori + mat* is introduced into cells and persist as covalent 

circular extrachromosomal molecules* In this form the 
episomes accumulate to produce multiple copies in each 
cell and accordingly, also overproduce mRNA and its 
protein product. While this is desirable for producing 
20 amplified genes and gene products, the introduction of 

cloned genes into cells for use in gene therapy requires 
the control of gene copy number and attendant gene 
expression. Such control is introduced into the DiFi 
episome vector by introducing DNA sequences that stabilize 
25 artificial chromosomes containing linear double stranded 

DNA (DNA encoding a telomere) • Such sequences occur at 
the termini of natural chromosomes; in human chromosomes 
5 ' — AGGGTT— 3 9 is tandemly repeated to the extent of 10 of 
15 kb at every telomere (Blackburn, Science r 249:489, 
30 1990) . This tandemly repeated sequence is ligated to each 

end of a linearized cloning and expression vector to 
stabilize the termini. The addition of telomere sequences 
specific for other species provides for the stabilization 
of artificial chromosomes when introduced into those 

35 
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Centromere sequences ar kn wn to identify regions 
within chromosomes where kinetichores are organized and 
mitotic spindles are attached to the chromosomes, thus 
ensuring for the segregation of chromosomes during 
5 mitosis. DNA sequences that serve as centromeres are 
introduced into an internal region of the linearized 
cloning and expression vector which contain telomeres 
resulting in an artificial chromosome. This synthetic 
chromosome contains required regulatory and stabilizing 
10 DNA sequences that normally occur in natural chromosomes. 

Specific genetic function is conferred on this 
synthetic chromosome by ligating a gene of interest into 
its multiple cloning site. For example, the gene or cDNA 

15 derivative of the gene that is defective in Duchenne's 

muscular dystrophy or myotonic dystrophy, or one of a 
number of other diseases associated with muscle 
dysfunction may be cloned into the artificial chromosome. 
The artificial chromosome is then introduced into cells or 

20 tissues or animals by methods appropriate for the target. 

The transfected chromosome is established as an integral 
component of the recipient cells where it is stably 
maintained and expressed. Recipient cells, tissues or 
animals that were initially dysfunctional because of a 

25 genetic defect they possessed are cured and become normal 
because of the expression and synthesis of the normal gene 
product introduced in the artificial chromosome. 

Hi Evaluation of Different Strategies for 
30 Transfectina Cloning or Expressio n Vectors Into 

M«W al 7 an Cells 
Once the optimal cloning and expression vector is 
constructed, several different strategies for trans feet ing 
the vectors will be studied. Examples of potential 
35 methods includes: (l) encapsulation of insert-containing 
vectors in liposomes of appropriate composition to enhance 
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entry into target cells, and (2) electroporati n of vector 
into mitotic cell recipients to enhance its inclusion 
within the nucleus as cells progress into Gl phase of the 
cell cycle, and (3) injection of DNA-encoated particles 
into cells by employing a Biolistic Particle Delivery 
System (DuPont) . This procedure essentially shoots 
DNA-coated bullets into cells or tissues. 



Biosynth etic Production of Proteins in Cells 
10 Transf ect ed With Cloning and Expression Vectors 

containing Isolated Genes or Functional 
Derivatives 

Medically important proteins are produced in 
mammalian cells that have been transfected with the vector 

15 containing the gene encoding the protein. Since the 

gene-containing vector accumulates in the transfected 
cells, the amount of protein produced increases as more 
vector copies accumulate. The following example 
illustrates an efficient system for protein production. 

20 To produce the product of the gene that is deficient in 
patients with myotonic dystrophy, the vector containing 
the normal gene is electroporated into a normal primary 
human fibroblast cell line HSF56, adapted for growth in 
suspension culture in serum free medium. The accumulation 

25 of the cloning vector in each cell is accelerated by 

growing the cells in the drug appropriate for the drug 
resistance gene contained in the vector. As the gene copy 
number accumulates the amount of protein increases to be 
recovered from the culture medium or from the cells after 

30 maximal growth is achieved. The medical condition of 
patients with myotonic dystrophy may be improved by 
treatment with the protein that is provided by this 
cloningexpression system. Modification of the vector to 
include other genes and selection of target cells and 

35 appropriate culture conditions provides endless possible 
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systems for the production and isolation of mammalian 
proteins - 

The foregoing description has been directed to 
5 particular embodiments of the invention in accordance with 
the requirements of the Patent Statutes for the purposes 
of illustration and explanation. It will be apparent, 
however, to those skilled in this art, that many 
modifications and changes in the apparatus and procedure 
10 set forth will be possible without departing from the 

scope and spirit of the invention. It is intended that 
the following claims be interpreted to embrace all such 
modifications and changes. 
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CLAIMS 

1. A composition of matter comprising a 
substantially purified non-viral episome of human origin 
capable of stable extrachromosomal maintenance and of 

* 

autonomous replication in a compatible mammalian cell 
line. 



10 2. A substantially purified episomal DNA segment 

containing an origin of replication. 



3. A substantially purified episomal DNA segment 
15 containing a DNA sequence which confers upon a vector 

i eluding said segment the ability to be stably maintained 
extrachromosomal ly in a cell transfected with said vector. 



20 4. A substantially purified episomal DNA segment 

containing an origin of replication and a DNA sequence 
which confers upon a vector including said segment the 
ability to be stably maintained extrachromosomally in a 
cell transfected with said vector* 



5. The substantially purified episomal DNA segment 
of claim 2, 3, or 4 wherein the episomal DNA segment is 
from an episome isolated from DiFi colorectal cell line. 



6. A cloning vector comprising the following 
components operatively spaced with respect to a desired 
gene: 



35 
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a) a DNA segment derived from a non-viral episome, said 
segment containing an origin for DNA replication; 

b) a DNA segment derived from a non-viral episome, said 
5 segment containing a DNA sequence which confers upon 

said vector the ability to be stably maintained 
extrachroxnosomally in a cell transfected with said 
vector; 

10 c) a DNA segment containing a multiple cloning site; 

d) a DNA selectable marker segment conferring upon a 
cell transfected with said vector the ability to 
survive in the presence of a selected compound or 

15 selected group of compounds; and 

e) a DNA segment encoding bacterial components necessary 
for propagation of said vector in bacteria. 



20 



25 



7. The cloning vector of claim 6 wherein said 
compound is selected from the group consisting of 6418 and 
hygromycin B. 



8. The cloning vector of claim 6 further including 
a DNA sequence encoding a desired protein. 



30 9. The cloning vector of claim 6 wherein the 

segment containing the origin for DNA replication is from 
an episome isolated from DiFi colorectal cell line. 

35 10. The cloning vector of claim 6 wherein the 

segment containing a DNA sequence which confers upon said 
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vect r the ability t be stably maintained 
extrachromosomally in a cell transfected with said vector 
is from an episome isolated from DiFi colorectal cell 
line. 



11. The cloning vector of claim 6 wherein the 
chromosomal DNA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 

10 selected compound or selected group of compounds toxic to 

said cell when said selectable marker segment is not 
present in said cell and wherein said selectable marker 
segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 

15 said transfected cell resistant to said selected compound 

or selected group of compounds. 



12. The cloning vector of clc.im n wherein said 

20 enzyme is selected from the group consisting of: thymidine 

kinase, xanthine-guanine phosphor ibosyl transferase, 
adenine phosphor ibosyl transf erase, adenosine deaminase and 
dihydrofolate reductase. 

25 

13. A cloning vector comprising the following 
components operatively spaced with respect to a desired 
gene : 

30 a) a DNA segment derived from a non-viral episome, 

said segment containing an origin for DNA 
replication and a DNA sequence which confers 
upon said vector the ability to be stably 
maintained extrachromosomally in a cell 

35 transfected with said vector; 
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b) a DNA segment containing a multiple cloning 



c) a DNA segment conferring upon a cell transf ected 
5 with said vector the ability to survive in the 

presence of a selected compound or selected 
group of compounds; and 

d) a DNA segment encoding bacterial components 
10 necessary for propagation of said vector in 



14. The cloning vector of claim 13 wherein said 
15 compound is selected from the group consisting of G418 and 

hygromycin B. 



15. The cloning vector of claim 13 further including 
20 a DNA sequence encoding a desired protein. 



16. The cloning vector of claim 13 wherein the DNA 
segment containing an origin for DNA replication and a DNA 

25 sequence which confers upon said vector the ability to be 

stably maintained extrachromosomally in a cell transfected 
with said vector is from an episome isolated from DiFi 
colorectal cell line. 

30 

17. The cloning vector of claim 13 wherein the 
chromosomal DNA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 
selected compound or selected group of compounds toxic to 

35 said cell when said selectable marker segment is not 

present in said cell and wherein said selectable marker 
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segment contains DNA encoding an nzyme capabl of 
functionally replacing said mutated enzyme so as to render 
said transf ected cell resistant to said selected compound 
or selected group of compounds. 



18* The cloning vector of claim 17 wherein said 
enzyme is selected from the group consisting of: thymidine 
kinase, xanthine-guanine phosphor ibosyl transferase, 
10 adenine phosphor ibosyltransf erase, adenosine deaminase and 
dihydrofolate reductase. 



19. An expression vector comprising the following 
15 components operatively spaced with respect to a desired 

gene: 

a) a DNA segment derived from a non-viral episome, 
said segment containing an origin for DNA 

20 replication; 

b) a DNA segment derived from a non-viral episome, 
said segment containing a DNA sequence which 
confers upon said vector the ability to be 

25 stably maintained extrachromosomally in a cell 

transf ected with said vector; 

c) a DNA segment containing a multiple cloning 
site ; 



d) a DNA segment conferring upon a cell transfected 
with said vector the ability to survive in the 
presence of a selected compound or selected 
group of compounds; 



35 
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e) a DNA segment: encoding bacterial components 
necessary for propagation of said vector In 
bacteria; and 

f ) a promoter, a polyadenylatlon site, and a splice 
site In special relation to allow the efficient 
expression of a structural gene upon insertion 
of said gene Into said splice site. 



10 



20. The 
compound is 
hygromycin B 



vector of claim 19 wherein said 
from the group consisting of G418 and 



21* The expression vector of claim 19 further 
including a DNA sequence encoding a desired gene* 



20 



22* The expression vector of claim 19 wherein the 
bacterial components necessary for propagation of said 
vector in bacteria are derived from pBR322, pUC, pT7/T3a 
18 or pKS. 



25 



23. The expression vector of claim 19 wherein the 
segment containing the origin for DNA replication is from 
an episome isolated from DiFi colorectal cell line. 



30 



35 



24. The expression vector of claim 19 wherein the 
segment containing a DNA sequence which confers upon said 
vector the ability to be stably maintained 
extrachromosomally in a cell transfected with said vector 
is from an episome isolated from DiFi colorectal cell 
line. 
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25. The expression vector of claim 19 wherein the 
chromosomal DNA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 
5 selected compound or selected group of compounds toxic to 
said cell when said selectable marker segment is not 
present in said cell and wherein said selectable marker 
segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 
10 said transfected cell resistant to said selected compound 
or selected group of compounds. 



26. The expression vector of claim 25 wherein said 
15 enzyme is selected from the group consisting of: thymidine 

kinase, xanthine-guanine phosphoribosyl transferase, 
adenine phosphoribosyl transferase, adenosine deaminase and 
dihydrof olate reductase. 

20 

27. An expression vector comprising the following 
components operatively spaced with respect to a desired 
gene: 

25 a) a DNA segment derived from a non- viral episome, 

said segment containing an origin for DNA 
replication and a DNA sequence which confers 
upon said vector the ability to be stably 
maintained extrachromosomally in a cell 

30 transfected with said vector; 

b) a DNA segment containing a multiple cloning 



35 



c) 



a DNA segment conferring upon a cell transfected 
with said vector the ability to survive in the 
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pres nee of a selected compound or sel cted 
group of compounds; 

d) a DNA segment encoding bacterial components 
5 necessary for propagation of said vector in 

bacteria; and 

e) a promoter, a polyadenylation site, and a splice 
site in special relation to allow the efficient 

10 expression of a structural gene upon insertion 

of said gene into said splice site. 



28. The expression vector of claim 27 wherein said 
15 compound is selected from the group consisting of 6418 and 

hygromycin B. 



29. The expression vector of claim 27 further 
20 including a DNA sequence encoding a desired protein. 

30. The expression vector of claim 27 wherein the 
bacterial components necessary for propagation of said 

25 vector in bacteria are derived from pBR322, pUC, pT7/T3a- 

18 or pKS. 



31. The expression vector of claim 27 wherein the 
30 DNA segment containing an origin for DNA replication and a 

DNA sequence which confers upon said vector the ability to 
be stably maintained extrachromosomally in a cell 
trans feet ed with said vector is from an episome isolated 
from DiFi colorectal cell line. 



35 
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32. The expression vector £ claim 27 wherein th 
chromosomal DMA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 
selected compound or selected group of compounds toxic to 

5 said cell when said selectable marker segment is not 

present in said cell and wherein said selectable marker 
segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 
said transfected cell resistant to said selected compound 
10 or selected group of compounds. 

33. The cloning vector of claim 32 wherein said 
enzyme is selected from the group consisting of: thymidine 

15 kinase , xanthine-guanine phosphor ibosyl transferase, 

adenine phosphoribosyl transferase, adenosine deaminase and 
dihydrof olate reductase. 



20 34. The expression vector of claim 19 or 27 wherein 

the promoter is selected from the group consisting of: 
cytomegalovirus promoter, SV-40 promoter, Rous sarcoma 
virus promoter, thymidine kinase promoter, beta-actin 
promoter, metallothionein promoter, and epidermal growth 

25 factor receptor gene promoter isolated from a DiFi 

episome. 



35. An artificial chromosome comprising: 
30 a DNA segment derived from a non-viral episome, said 

segment containing an origin for DNA replication, a DNA 
segment derived from a non-viral episome, said segment 
containing a DNA sequence which confers upon said vector 
the ability to be stably maintained extrachromosomally in 
35 a cell transfected with said vector, a DNA segment 

containing a multiple cloning site, a DNA selectable 
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10 



15 



marker segm nt conferring up n a cell transf ected with 
said vector, the ability to survive in the presence of a 
selected compound or selected group of compounds, a DNA 
segment encoding bacterial components necessary for 
propagation of said vector in bacteria, a promoter, a 
polyadenylation site, a splice site, a 0NA segment 
encoding a centromere and a DNA segment encoding a 
telomere. 



36. The artificial chromosome of claim 35 wherein 
said compound is selected from the group consisting of 
G418 and hygromycin B. 

37. The artificial chromosome of claim 35 further 
including a DNA sequence encoding a desired protein. 



20 38. The artificial chromosome of claim 35 wherein 

the chromosomal DNA of said transfected cell contains a 
mutation in an enzyme, said mutation rendering said 
selected compound or selected group of compounds toxic to 
said cell when said selectable marker segment is not 

25 present in said cell and wherein said selectable marker 

segment contains DNA encoding an enzyme capable of 
functionally replacing said mutated enzyme so as to render 
said transfected cell resistant to said selected compound 
or selected group of compounds. 

30 

39. The artificial chromosome of claim 38 wherein 
said enzyme is selected from the group consisting of; 
thymidine kinase, xanthine-guanine phosphoribosyl 
35 transferase, adenine phosphor ibosyltransf erase, adenosine 

deaminase and dihydrofolate reductase. 
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